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DETAILED ACTION 

1 . This action is responsive to communications: Amendment filed on 12/4/05. 

2. Claims 1 - 48 are pending in the case. Claims 1 , 26 and 37 are independent. 

3. The objection to the Drawings has been withdrawn as necessitated by 
amendment. 

4. The rejection of claims 1 - 14, 26 - 47 under judicially created doctrine of double 
patenting has been withdrawn as necessitated by a Terminal Disclaimer. 

5. The rejection of claims 9, 1 1 , 33, 34, 44, and 45 under 35 U.S.C. 1 1 2, second 
paragraph as being indefinite has been withdrawn as necessitated by amendment 

6. The rejection of claims 1 - 48 under 35 U.S.C. 103(a) as being unpatentable has 
been withdrawn as necessitated by amendment. 

Terminal Disclaimer 

7. The terminal disclaimer filed on 12/14/05 disclaiming the terminal portion of any 
patent granted on this application which would extend beyond the expiration date of 
10/108587 has been reviewed and is accepted. The terminal disclaimer has been 
recorded. 

Claim Rejections • 35 USC § 101 

8. 35 U.S.C. 101 reads as follows: 

Whoever invents or discovers any new and useful process, machine, manufacture, or composition of 
matter, or any new and useful improvement thereof, may obtain a patent therefor, subject to the 
conditions and requirements of this title. 

9. Claims 1 - 48 are rejected under 35 U.S.C. 101 because the claimed invention is 
directed to non-statutory subject matter. The claims are simply methodologies for 
assembling data and thus do not produce a concrete, useful, and tangible result. 
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10. Further, to expedite a complete examination of the instant application the claims 
rejected under 35 U.S.C. 101 (nonstatutory) above are further rejected as set forth 
below in anticipation of applicant amending these claims to place them within the four 
statutory categories of invention. 

Claim Rejections - 35 USC §112 

11. The following is a quotation of the second paragraph of 35 U.S.C. 112: 

The specification shall conclude with one or more claims particularly pointing out and distinctly 
claiming the subject matter which the applicant regards as his invention. 

12. Claims 1 -48 are rejected under 35 U.S.C. 112, second paragraph, as being 
indefinite for failing to particularly point out and distinctly claim the subject matter which 
applicant regards as the invention. 

13. The term "potentially" in claim 1 is a relative term which renders the claim 
indefinite. The term "potentially" is not defined by the claim, the specification does not 
provide a standard for ascertaining the requisite degree, and one of ordinary skill in the 
art would not be reasonably apprised of the scope of the invention. Consequently, the 
metes and bounds of "linking to a candidate document page potentially part of the 
hyperdocument" are unclear. 

14. The term "potentially" in claim 26 is a relative term which renders the claim 
indefinite. The term "potentially" is not defined by the claim, the specification does not 
provide a standard for ascertaining the requisite degree, and one of ordinary skill in the 
art would not be reasonably apprised of the scope of the invention. Consequently, the 
metes and bounds of "linking to a candidate document page potentially part of the 
hyperdocument" are unclear. 
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15. The term "typical" in claim 1 is a relative term which renders the claim indefinite. 
The term "typical" is not defined by the claim, the specification does not provide a 
standard for ascertaining the requisite degree, and one of ordinary skill in the art would 
not be reasonably apprised of the scope of the invention. Consequently, the metes and 
bounds of "not characteristic of typical intra-document links" are unclear. 

16. The term "typical" in claim 26 is a relative term which renders the claim indefinite. 
The term "typical" is not defined by the claim, the specification does not provide a 
standard for ascertaining the requisite degree, and one of ordinary skill in the art would 
not be reasonably apprised of the scope of the invention. Consequently, the metes and 
bounds of "not characteristic of typical intra-document links" are unclear. 

17. The term "typical" in claim 37 is a relative term which renders the claim indefinite. 
The term "typical" is not defined by the claim, the specification does not provide a 
standard for ascertaining the requisite degree, and one of ordinary skill in the art would 
not be reasonably apprised of the scope of the invention. Consequently, the metes and 
bounds of "not characteristic of typical intra-document links" are unclear. 

18. Regarding dependent claims 2 - 25, 27 - 36 and 38 - 48, the claims are rejected 
for fully incorporated the deficiencies of the base claim(s) from which they depend. 

Claim Rejections - 35 USC § 103 

19. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 1 02 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 
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20. Claims 1 - 6, 1 0, 1 2 - 1 8, 26 - 30, 35 - 41 , and 46 - 48 are rejected under 35 
U.S.C. 103(a) as being unpatentable over Bharat et al. (US 61 12203 A) and in further 
view of Earl (US 5924104 A). 

21 . Regarding independent claim 37, 

a. Bharat et al. teach that we locate pages that point to at least one of the 
pages in the start set 201. We call this set of pages the back set 202. With the 
AltaVista search engine, "link: URL" queries can be used to identify back set 
pages for each start set page. We add one node 212 to the n-graph 21 1 for each 
page of the back set 202. Similarly, the pages pointed to by the start set 201 are 
located. This can be done by fetching each start set page and extracting the 
hyperlinks in each of the pages. The pages pointed to by the hyperlinks 
constitute the forward set 203. Nodes for the forward set of pages are also 
added to the n-graph 211. Thus, the input set of pages 204 includes the back, 
start, and forward sets 201-203. The input set 204 includes pages which do not 
directly satisfy the query, i.e., pages that do not include key words exactly as 
specified in the query. However, these pages may be useful because they are 
linked to pages of the start set. A larger n-graph 211 can be constructed by 
repeating this process for the back and forward sets 202-203 to add more 
indirectly linked pages. At this stage, the n-graph 211 has nodes 212 but no 
edges. After we have constructed the nodes 212, we add the directed edges 
213. If a link points to a page that is represented by a node in the graph, and 
both pages are on different servers, then a corresponding edge 213 is added to 
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the graph 211. Nodes representing pages on the same server are not linked. 
This prevents a single Web site with many self-referencing pages to unduly 
influence the outcome. This completes the n-graph 211 (Column 4, line 61 - 
Column 5, line 20), compare with performing a page-level link analysis that 
identifies those hyperlinks on a page linking to a candidate document page 
further comprising a methodology of: identifying possible progression 
links; identifying possible table of content links, and; examining the 
possible progression links and the possible table of content links for 
common characteristics; performing a recursive application of the page- 
level link analysis to the linked candidate document page and any further 
nested candidate document pages thereby identified, until a collective set 
of identified candidate document pages is assembled, 
b. Bharat et al. do not explicitly teach performing a document-level 
analysis that examines the collective set of identified candidate document 
pages for grouping into one or more documents; examining the collective 
set of identified candidate document pages to weed out links which have 
properties that are not characteristic of typical intra-document links, to 
provide a resultant set of identified candidate document pages; and 
grouping the content found in the resultant set of candidate document 
pages into a document representation for subsequent viewing or printing 
of the given hyperdocument. 
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c. However, Earl teaches that a link display manager 300 is illustrated in 
functional block diagram form. The link display manager 300 includes a user 
input processor 302 for processing user link selections as indicated at lines 
labeled MOVING WITHIN CURRENT DOCUMENT and SELECTING NEW 
DOCUMENT The link display manager 300 includes a document parser 304 for 
parsing each document and identifying links 202 and 204 and a display system 
for defining predetermined screen element properties providing visual cues for 
distinguishing the identified links 202 and 204. When a user provides an input 
link selection to select a new document, the document parser 304 parses the 
selected new document to identify intradocument links 202 and interdocument 
links 204. The display system 306 processes the identified intradocument links 
202 and interdocument links 204 for displaying distinctively the intradocument 
links 202 and interdocument links 204 with predetermined visual cues to 
differentiate the links 202, 204 (Column 2, line 59 - Column 3, line 9), compare 
with performing a document-level analysis that examines the collective set 
of identified candidate document pages for grouping into one or more 
documents; examining the collective set of identified candidate document 
pages to weed out links which have properties that are not characteristic of 
typical intra-document links, to provide a resultant set of identified 
candidate document pages; and grouping the content found in the 
resultant set of candidate document pages into a document representation 
for subsequent viewing or printing of the given hyperdocument. 
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d. It would have been obvious to one of ordinary skill in the art at the time of 
the invention to combine the invention of Bharat et al. with that of Earl because 
such a combination would provide the users of Bharat et al. with an improved 
method and apparatus for displaying links on a user display interface in a 
computer system (Column 1 , lines 39 - 41 ). 

22. Regarding dependent claims 38 - 41 , Bharat et al. teach that the nodes in the 
start set are first scored according to their connectivity, and the number of terms of the 
query that appear as unique sub-strings in the URL of the represented documents. The 
score is a weighted sum of the number of directed edges to and from a node and the 
number of unique sub-strings of the URL that match a query term (Column 3, lines 10 - 
15), compare with the page-level link analysis includes examination of contextual 
clues, the contextual clue is a particular class of content item associated with the 
hyperlink, the class of content item is a class of text, the class of text is a 
directional word or phrase. 

23. Regarding dependent claim 46, Bharat et al. teach that we assign a similarity 
weight to each node 213 of the sub-graph 255. Various document similarity measuring 
techniques have been developed in Information Retrieval to determine the goodness of 
fit between a "target" document and a collection of documents. These techniques 
typically measure a similarity score based on word frequencies in the collection and a 
target document (Column 6, lines 51 - 57), compare with the contextual clue is the 
similarity of the hyperlink destination to that of other hyperlinks within the 
hyperdocument. 
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24. Regarding dependent claim 47, Bharat et al. teach that we use a modified 
Kleinberg algorithm on the nodes of the pruned n-graph 265 to determine useful hub 
and authority pages. For each node of the pruned n-graph 265, we measure two 
scores: a hub score (HS), which estimates how good a hub the page is, and an 
authority score (AS), which estimates how good an authority the page is. The intuition 
behind our method is this: a good hub is one that points to many documents. A good 
authority is one that is pointed to by many documents. Transitively, an even better hub 
is one that points to many good authorities, and an even better authority is one that is 
pointed to by many good hubs (Column 7, lines 41 - 50), compare with the document- 
level analysis includes the identification of pages forming a chain of progression 
links. 

25. Regarding dependent claims 18 and 48, Bharat et al. teach that after we have 
constructed the nodes 212, we add the directed edges 213. If a link points to a page 
that is represented by a node in the graph, and both pages are on different servers, 
then a corresponding edge 21 3 is added to the graph 211. Nodes representing pages 
on the same server are not linked. This prevents a single Web site with many self- 
referencing pages to unduly influence the outcome. This completes the n-graph 211 
(Column 5, lines 13 - 20), compare with the similarity includes the location at which 
the page is stored, and the document-level analysis includes the identification of 
pages linked to by the same tables of contents. 
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26. Regarding claims 1 - 6, 10 and 12 - 14, the claims incorporate substantially 
similar subject matter as claims 37 - 41 and 46 - 48 and are rejected along the same 
rationale. 

27. Regarding claims 26 - 30, 35 and 36, the claims incorporate substantially 
similar subject matter as claims 37 - 41 and 46 - 48 and are rejected along the same 
rationale. 

28. Regarding dependent claim 15-17, Bharat et al. teach that we use do iterative 
connectivity analysis 310, content analysis 320, and pruning 330. This method consists 
of a sequence of rounds. In each round, our modified connectivity analysis is run for 10 
iterations to get a listing of the (current) best hubs and authorities 315. In step 320, the 
pages are examined for content similarity in decreasing order of rank, alternating 
between the hub and the authority list. Less relevant pages are pruned (Column 8, 
lines 25 - 33), compare with the document-level analysis includes identifying the 
pages listed in a table of contents, the document-level analysis includes 
identifying as part of the document the page containing the table of contents, the 
document-level analysis includes the similarity of candidate pages. 

29. Claims 21 and 22 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Bharat et al. (US 61 12203 A) and Earl (US 5924104 A) as applied to claims 1 - 6, 
10, 1 2 - 1 8, 26 - 30, 35 - 41 , and 46 - 48 above, and further in view of Huang et al. 
(US 6601075 B1). 
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30. Regarding dependent claims 21 and 22, neither Bharat et al. nor Earl teach 
the similarity includes similar style specifications, and the similarity includes 
similar page layout. Huang et al. teach that the HITS and CLEVER algorithms make 
use of hyperlinked structures to rank documents that share the same schema. 
Exemplary documents with hyperlinked structures are HTML documents. XML has 
given rise to a new hyperlink environment that includes documents with different 
schemas. In this environment it will become increasingly important to identify high- 
quality schemas and documents that correctly use them. Hence, this new environment 
presents several previously unaddressed issues: ranking documents based on the 
quality of their associated schema, determining the quality of the schemas themselves, 
and ranking documents based on their structural properties (e.g. validity, well- 
formedness, etc.). The WWW today calls for a system that finds and identifies 
authoritative XML-documents that take these factors into account. This need, which 
makes use of the new dimension added by XML, has heretofore remained unsatisfied 
(Column 3, lines 37 - 53), compare with the similarity includes similar style 
specifications, and the similarity includes similar page layout. It would have been 
obvious to one of ordinary skill in the art at the time of the invention to combine the 
combined invention of Bharat et al. and Earl with that of Huang et al. because such a 
combination would allow the users of Bharat et al. and Earl the benefit of an algorithm 
which is applied to an initial set of documents, similar to the HITS and CLEVER 
algorithms (Column 3, line 66 - Column 4, line 1 ). 
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31 . Claims 23 and 25 are rejected under 35 U.S.C. 1 03(a) as being unpatentable 
over Bharat et al. (US 61 12203 A) and Earl (US 5924104 A) as applied to claims 1 - 6, 
1 0, 1 2 - 1 8, 26 - 30, 35 - 41 , and 46 - 48 above, and further in view of Law et al. (US 
6754873 B1). 

32. Regarding dependent claims 23 and 25, neither Bharat et al. nor Earl teach 
the similarity includes similar logical structure of the page content, the 
document-level analysis includes analysis of the topological structure of the 
linked pages. Law et al. teach that the link structure of the hyperlinked documents is 
analyzed in order to find hyperlinked documents that are related to and at the same 
level of generality of a hyperlinked document (Column 2, lines 8-11), compare with the 
similarity includes similar logical structure of the page content, the document- 
level analysis includes analysis of the topological structure of the linked pages It 
would have been obvious to one of ordinary skill in the art at the time of the invention to 
combine the combined invention of Bharat et al. and Earl with that of Law et al. because 
such a combination would allow the users of Bharat et al. and Earl the benefit of 
innovative techniques for finding related hyperlinked documents using link-based 
analysis (Column 2, lines 6 - 8). 

33. Claims 7 - 9, 1 1 , 1 9, 20, 24, 31 - 34, and 42 - 45 are rejected under 35 
U.S.C. 103(a) as being unpatentable over Bharat et al. (US 6112203 A) and Earl (US 
59241 04 A) as applied to claims 1 - 6, 1 0, 1 2 - 1 8, 26 - 30, 35 - 41 , and 46 - 48 
above, and further in view of Prince (US 6877002 B2). 
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34. Regarding dependent claims 7 - 9, 1 1, 19, 20, 24, 31 - 34, and 42 - 45, 

neither Bharat et al. nor Earl explicitly teach meta-data or image. However, Prince 
teaches that the parsed results (from step 42 in FIG. 4) relating to the media are passed 
to extraction agent 68 via an extraction queue 67. Results not associated with the 
media are not pursued. The extraction queue 67 comprises URLs to be analyzed with 
respect to associated media metadata. The extraction queue 67 may comprise 
metadata queue entries such as media URLs, Web page URLs, Web page titles, Web 
page keywords, Web page descriptions, media title, media author, and media genre. 
Each queue entry added to the extraction queue is assigned a processing time and a 
priority. In an exemplary embodiment of the invention, each queue entry is given a 
processing time of "now" and the same default priority. The iterative seeding process 
increases the number of queue entries added to the extraction queue 67 (Column 7, 
lines 23 - 37), compare with the similarity includes the similarity of meta-data 
associated with the page, the meta-data includes the author identification, the 
similarity includes the presence of at least one similar content item on each page, 
the class of content item is a class of image, the class of image is an image 
containing a directional symbol, a textual clue is obtained for the class of image, 
the contextual clue is the presence of at least one other hyperlink nearby with the 
candidate document page. It would have been obvious to one of ordinary skill in the 
art at the time of the invention to combine the combined invention of Bharat et al. and 
Earl with that of Prince because such a combination would allow the users of Bharat et 
al. and Earl the benefit of A method for querying metadata associated with media on a 
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computer network includes separating the metadata into keywords (Column 2, lines 37 
-39). 

Response to Arguments 

35. Applicant's arguments with respect to claims 1 - 48 have been considered but 
are moot in view of the new ground(s) of rejection. 

Conclusion 

36. Applicant's amendment necessitated the new ground(s) of rejection presented in 
this Office action. Accordingly, THIS ACTION IS MADE FINAL. See MPEP 

§ 706.07(a). Applicant is reminded of the extension of time policy as set forth in 37 
CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within 
TWO MONTHS of the mailing date of this final action and the advisory action is not 
mailed until after the end of the THREE-MONTH shortened statutory period, then the 
shortened statutory period will expire on the date the advisory action is mailed, and any 
extension fee pursuant to 37 CFR 1 .136(a) will be calculated from the mailing date of 
the advisory action. In no event, however, will the statutory period for reply expire later 
than SIX MONTHS from the date of this final action. 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Nathan Hillery whose telephone number is (571 ) 272- 
4091. The examiner can normally be reached on M - F, 10:30 a.m. - 7:00 p.m. 
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If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Heather R. Herndon can be reached on (571 ) 272-4136. The fax phone 
number for the organization where this application or proceeding is assigned is 571- 
273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). 




Doug Hutton 
Primary Examiner 
Art Unit 21 76 
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